I j 

-54- 

What Is Claimed Is: 

1 . A methoti of selecting one or more compounds from a virtual combinatorial 
library associated with R reagent combinations, comprising the following steps: 

a. selecting a first set of N reagent combinations from the R reagent 
combinations associated with the virtual combinatorial library, 
wherein N < R; 

b. enumerating said first set of N reagent combinations to produce a 
first set\pf F enumerated compounds; 

c. selecting wl compounds from said first set of F enumerated 
compounds\ wherein M < F; 

d. deconvolutin&said M compounds into reagents; 

e. generating a focused library based on said reagents; and 

f. enumerating a plurality of reagent combinations associated with 
said focused library \o produce a second set of S enumerated 
compounds. \ 



2. The method of claim 1, further comprise 
g. selecting K compounds from 
compounds, wherein K < S. 



the step of: 

>aid second set of S enumerated 
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3. The method of claim 2, wherein step (g.) comprises selecting said K 
compoundslsuch that said K compounds can be produced using a predetermined 
number T ofVeagents. 

4. The method of claim 1, wherein step (a.) comprises selecting said first set of 
N reagent combinations substantially at random from the virtual combinatorial 
library. \ 

5. The method of claim l,\wherein step (a.) comprises selecting said first set of 
N reagent combinations suclMiat a substantially uniform coverage of a reagent 
space associated with the virtual combinatorial library is achieved. 

6. The method of claim 1, wherein step (a.) comprises selecting said first set of 
reagent combinations such that each reagent in the virtual combinatorial library is 
selected a substantially equal number of times. 

7. The method of claim 1, wherein step Qc.) comprises selecting said M 
compounds from said first set of F enumerated Compounds based on a fitness 
function. \ 

8. The method of claim 7, further comprising the step of: \ 

g. selecting K compounds from said second set of S enumerated 
compounds based on said fitness function, wherein^K < S. 
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9. The method of claim 7, further comprising the step of: 

g. \ selecting K compounds from said second set of S enumerated 
\ compounds based on said fitness function and such that said K 
Compounds can be produced using a predetermined number T of 
reagents, wherein K < S. 

10. The method of claim 8, wherein step (c) comprises 

i. initially selecting M compounds from said first set of 
enumerated compounds to produce a first sub-set of 
enumerated compounds; 

ii. evaluating sakl first sub-set of enumerated compounds 
based on said fitness function; and 

iii. refining said first siib-set of enumerated compounds based 
on said fitness function. 

1 1 . The method of claim 10, wherein step (g.) comprises 

i. initially selecting K compounds from said second set of 
enumerated compounds to produce a second sub-set of 
enumerated compounds; \ 

ii. evaluating said second sub-set of enumerated compounds 
based on said fitness function; and \ 
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7 iii.\ refining said second sub-set of enumerated compounds 

8 \ based on the fitness function. 

1 12. The method of claim 1 1 , wherein the fitness function is related to diversity of 

2 a collection of compounds, and wherein step (c.ii.) comprises evaluating the 

3 diversity of said first sub-set\pf enumerated compounds, and wherein step (c.iii.) 

4 comprises refining said first sub-set to increase the diversity of said first sub-set. 

*£jl 13. The method of claim 12, wherein step (g.ii.) comprises evaluating the 

J"j2 diversity of said second sub-set of enumerated compounds, and wherein step 

|L3 (g iii ) comprises refining said second siab-set to increase the diversity of said 

o4 second sub-set. \ 

|fl 14. The method of claim 8, wherein step (c.) comprises: 

2 i. characterizing each compound of said first set of 

3 enumerated compounds; \ 

4 ii . evaluating each characterized compound of said first set of 

5 enumerated compounds based on said fitness function; 

6 iii. ranking each characterized compoundVf said first set of 

7 enumerated compounds; and \ 

8 iv. selecting M compounds of said first set oi\enumerated 

9 compounds based on said ranking. \ 
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15. The method oiy claim 14, wherein step (g.) comprises: 

characterizing each compound of said second set of 
Venumerated compounds; 

evaluating each characterized compound of said second set 
of enumerated compounds based on said fitness function; 
ranking each characterized compound of said second set of 
enumerated compounds; and 

selecting K compounds of said second set of compounds 
based on sa\d ranking. 



n. 



in. 



IV. 



16. The method of claim 15, whereimstep (c.i.) comprises characterizing each 
compound of said first set of enumerated compounds using a set of molecular 
descriptors. 

17. The method of claim 16, wherein step (gX) comprises characterizing each 
compound of said second set of enumerated compounds using said set of 
molecular descriptors. 



18. The method of claim 15, wherein the fitness function ik related to similarity 
to one or more query structures, and wherein step (c.ii.) comprises evaluating 
similarity between each compound of said first set of enumerated compounds and 
the one or more query structures. 
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19. The methpd\pf claim 18, wherein at least one of the following similarity 
measures is used inWep (c.ii.) for evaluating similarity between each compound 
and the one or more query structures: 

(V) similarity in number of atoms, bonds and rings of 
\ the same types; 

(2) \ similarity in shape and surface characteristics; 

(3) Similarity in electron density distribution; 

(4) similarity based on common substructure; 

(5) similarity based on the presence and orientation of 
pharmacophoric groups; 

(6) similarity in binding affinity; and 

(7) similarity in degree of conformational overlap with 
a know receptor binder. 

20. The method of claim 1 8, wherein step (g.ii.j\eomprises evaluating similarity 
between each compound of said second set of enumerated compounds and the one 
or more query structures, and wherein the same similarity measure is used for 
evaluating similarity in step (c.ii.) and step (g.ii.)- \ 

2 1 . The method of claim 1 4, wherein the fitness function is related to at least one 
desired characteristic, and wherein step (c.ii.) comprises evaluating each 
compound of said first set of enumerated compounds to determine anfextent that 
each compound possesses the at least one desired characteristic. \ 
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1 22. The method of\claim 21, wherein the at least one desired characteristic 

2 comprises at least one\of the following: 

3 (u ) a desired physical property; 

4 (2\ a desired chemical property; 

5 (3)\ a desired functional property; and 

6 (4) \a desired bioactive property. 

|ft 23. A system for selecting, based on a fitness function, one or more compounds 

If 2 from a virtual combinatorial library Associated with R reagent combinations, 

ft comprising: \ 

means for selecting a first set of N reagenVcombinations from the R reagent 
u5 combinations associated with the virtuaPcombinatorial library, wherein R 

g6 <N; \ 

7 means for enumerating said first set of N reagent combinations to produce a first 

8 set of F enumerated compounds; \ 

9 means for selecting M compounds of said first set of F enumerated compounds 

10 based on the fitness function, wherein M < F; \ 

1 1 means for deconvoluting said M compounds into reagents; \ 

12 means for generating a focused library based on said reagents; \ 

13 means for enumerating a plurality of reagent combinations associated with said 

14 focused library to produce a second set of S enumerated compounds; 
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means for selecting K compounds of said second set of enumerated compounds 
based on the fitness function, wherein K < S. 

24. A computer program product comprising a computer useable medium having 
computer program logic recorded thereon for enabling a processor to assist in 
selecting, based on a fitiaess function, one or more compounds, from a virtual 
combinatorial library associated with R regent combinations, the computer 
program logic comprising: \ 

means for enabling a processor tOiSelect a first set of N reagent combinations from 

the R reagent combinations associated with the virtual combinatorial 

library, wherein N < R; \ 
means for enabling a processor to enumerate said first set of N reagent 

combinations to produce a first set of F enumerated compounds; 
means for enabling a processor to selecUM compounds of said first set of 

enumerated compounds based on the fitness function, wherein M < F; 
means for enabling a processor to deconvolute said M compounds into reagents; 
means for generating a focused library based on sard reagents; 
means for enabling a processor to enumerate a pluralityyof reagent combinations 

associated with said focused library to produce a second set of S 

enumerated compounds; and \ 
means for enabling a processor to select K compounds of said second set of 

enumerated compounds based on the fitness function, wherein K < S. 
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1 25. A method of selecting one or more compounds from an enumerated virtual 

2 combinatorial library associated with R enumerated compounds, comprising the 

3 following steps: \ 

4 a. selecting a first set of N enumerated compounds for the R 

5 enumerated compounds associated with the enumerated virtual 

6 combinatorial library, wherein N < R; 

7 b. selecting M compounds from said first set of N enumerated 
SB compounds, wherein MVc N; 

%=p c. deconvoluting said M compounds into reagents; and 

4fc> d. using said reagents to extract \n enumerated focused library from 

~£ \ 

fM said enumerated virtual combinatorial library, said enumerated 

^p2 focused library including S enumerated compounds. 

igl 26. The method of claim 25, further comprising the step of: 

2 e. selecting K compounds from said S enumerated compounds, 

3 wherein K < S. \ 

1 27. The method of claim 26, wherein step (e.) comprises selecting said K 

2 compounds such that said K compounds can be produced using a predetermined 

3 number T of reagents. \ 
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28. The method of claim 25, wherein step (a.) comprises selecting said first set 
of N enumerated compounds substantially at random from the enumerated virtual 
combinatorial library. \ 

29. The method of claim 25, wherein step (a.) comprises selecting said first set 
of N enumerated compounds sufch that a substantially uniform coverage of a 
reagent space associated with the enumerated virtual combinatorial library is 
achieved. \ 

30. The method of claim 25, wherein step (k.) comprises selecting said first set 
of enumerated compounds such that each reagent in the enumerated virtual 
combinatorial library is selected a substantially equal number of times. 

31. The method of claim 25, wherein step (b.) comWises selecting said M 
compounds from said first set of N enumerated compounds based on a fitness 
function. \ 

32. The method of claim 31, further comprising the step of: \ 

e . selecting K compounds from said S enumerated compounds based 
on said fitness function, wherein K < S. \ 



33. The method of claim 31, further comprising the step of: 
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e. selecting K compounds from said S enumerated compounds based 
on saicKfitness function and such that said K compounds can be 
produceausing a predetermined number T of reagents, wherein K 
<S. \ 

34. The method of claim 32, wherein step (b.) comprises 

i. initially selecting M compounds from said first set of 
enumerated Vompounds to produce a first sub-set of 
enumerated compounds; 

ii. evaluating said first sub-set of enumerated compounds 
based on said fitnesk function; and 

iii. refining said first sub-sVt of enumerated compounds based 
on said fitness function. \ 

35. The method of claim 34, wherein step (e.) composes 

i. initially selecting K compounds Y om sa id S enumerated 
compounds to produce a second sVb-set of enumerated 
compounds; \ 

ii. evaluating said second sub-set of enumerated compounds 
based on said fitness function; and \ 

iii. refining said second sub-set of enumerated compounds 
based on said fitness function. \ 
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36. The method of claim 35, wherein the fitness function is related to diversity of 
a collection of\compounds, and wherein step (b.ii.) comprises evaluating the 
diversity of said first sub-set of enumerated compounds, and wherein step (b.iii.) 
comprises refining, said first sub-set to increase the diversity of said first sub-set. 

37. The method of claim 36, wherein step (e.ii.) comprises evaluating the diversity 
of said S enumerated compounds, and wherein step (e.iii.) comprises refining said 
S enumerated compounds\to increase the diversity of said S enumerated 
compounds. \ 

38. The method of claim 32, wherem step (b.) comprises: 

i. characterizing each compound of said first set of 
enumerated compounds; 

ii . evaluating each characterized compound of said first set of 
enumerated compounds based on said fitness function; 

iii. ranking each characterized Compound of said first set of 
enumerated compounds; and \ 

iv. selecting M compounds of said ¥irst set of enumerated 
compounds based on said ranking. \ 

39. The method of claim 38, wherein step (e.) comprises: \ 

i. characterizing each compound of said S enumerated 
compounds; \ 
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ii. evaluating each characterized compound of said S 
enumerated compounds based on said fitness function; 

iii. ranking each characterized compound of said S 
enumerated compounds; and 

iv. selecting K compounds of said S enumerated compounds 
based omsaid ranking. 

40. The method of claim 39, wherein step (b.i.) comprises characterizing each 
compound of said first set of enumerated compounds using a set of molecular 
descriptors. \ 

41. The method of claim 40, wherein step fe.i.) comprises characterizing each 
compound of said S enumerated compounds using said set of molecular 
descriptors. \ 

42. The method of claim 38, wherein the fitness function is related to similarity 
to one or more query structures, and wherein step (b.iiA comprises evaluating 
similarity between each compound of said first set of enumerated compounds and 
the one or more query structures. \ 

43. The method of claim 42, wherein at least one of the following similarity 
measures is used in step (b.ii.) for evaluating similarity between eachVompound 
and the one or more query structures: \ 
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(1) \similarity in number of atoms, bonds and rings of 

me same types; 

(2) similarity in shape and surface characteristics; 

(3) similarity in electron density distribution; 

(4) similarity based on common substructure; 

(5) similarity based on the presence and orientation of 
pharmacoohoric groups; 

(6) similarity in\binding affinity; and 

(7) similarity in degree of conformational overlap with 
a know receptor Wider. 

44. The method of claim 42, wherein step (e.ii.) comprises evaluating similarity 
between each compound of said S enumerated compounds and the one or more 
query structures, and wherein the same similarity measure is used for evaluating 
similarity in step (b.ii.) and step (e.ii.). \ 

45 . The method of claim 38, wherein the fitness function is related to at least one 
desired characteristic, and wherein step (b.ii.) comprises evaluating each 
compound of said first set of enumerated compounds to determine an extent that 
each compound possesses the at least one desired characteristic. \ 

46. The method of claim 45, wherein the at least one desired characteristic 
comprises at least one of the following: \ 
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3 (r4 a desired physical property; 

4 (2) \ a desired chemical property; 

5 (3) \ a desired functional property; and 

6 (4) \ a desired bioactive property. 

1 47. A system for selecting, basea on a fitness function, one or more compounds 

2 from an enumerated virtual combinatorial library associated with R enumerated 
j3 compounds, comprising: \ 

%T\ means for selecting a first set of N enumerated from the R enumerated compounds 

fjB associated with the enumerated virtual combinatorial library, wherein R < 

5 N; \ 

!=j? means for selecting M compounds of said ffrst set of N enumerated compounds 

ij based on the fitness function, whereinW < N; 

fSi means for deconvoluting said M compounds into reagents; 

iri X 

10 means for extracting an enumerated focused library, based on said reagents, from 

1 1 the enumerated virtual combinatorial library, wherein said enumerated 

12 focused library includes S enumerated compounds; and 

13 means for selecting K compounds of said S enumerated compounds based on the 

14 fitness function, wherein K < S. \ 

1 4 8 . A computer program product comprising a computer useable medium having 

2 computer program logic recorded thereon for enabling a processor to assist in 

3 selecting, based on a fitness function, one or more compounds, from an 
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4 enumerated virtuaK combinatorial library associated with R enumerated 

5 compounds, the computer program logic comprising: 

6 means for enabling a processor to select a first set of N enumerated compounds 

7 from the R enumerated compounds associated with the enumerated virtual 

8 combinatorial library, wherein N < R; 

9 means for enabling a processor to select M compounds of said first set of 

10 enumerated compounds Based on the fitness function, wherein M < N; 

11 means for enabling a processor to de^onvolute said M compounds into reagents; 
|i means for extracting an enumerated focmsed library, based on said reagents, from 
Hj the enumerated virtual combinatorial library, wherein said enumerated 
P§ focused library includes S enumerated compounds; and 

V§ means for enabling a processor to select K compounds of said S enumerated 
t6 compounds based on the fitness function 5 Vvherein K < S. 



